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Abstract 

In this paper, we study performance limits of sensor localization from a novel perspective. Specifically, 
we consider the Cramer-Rao Lower Bound (CRLB) in single-hop sensor localization using measurements 
from received signal strength (RSS), time of arrival (TOA) and bearing, respectively, but differently from 
the existing work, we statistically analyze the trace of the associated CRLB matrix (i.e. as a scalar 
metric for performance limits of sensor localization) by assuming anchor locations are random. By the 
Central Limit Theorems for ?7-statistics, we show that as the number of the anchors increases, this scalar 
metric is asymptotically normal in the RSS/bearing case, and converges to a random variable which is 
an affine transformation of a chi-square random variable of degree 2 in the TOA case. Moreover, we 
provide formulas quantitatively describing the relationship among the mean and standard deviation of the 
scalar metric, the number of the anchors, the parameters of communication channels, the noise statistics 
in measurements and the spatial distribution of the anchors. These formulas, though asymptotic in the 
number of the anchors, in many cases turn out to be remarkably accurate in predicting performance 
limits, even if the number is small. Simulations are carried out to confirm our results. 
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I. Introduction 

Wireless sensor networks have a wide range of applications nowadays, including military operations, 
medical treatments, environmental sensing, water quality monitoring and many others 12, O. Location 
information plays a vital role in those applications, for it is useful to report the geographic origin of 
events, to assist in target tracking, to achieve geographic aware routing, to manage sensor networks, to 
evaluate their coverage, and so on. A sensor network generally consists of two types of nodes: anchors 
and sensors. Anchor locations are known a priori through GPS or manual configurations, while sensor 
locations are not known and need to be determined through the procedures of sensor localization. Up 
to now, considerable efforts have been invested into the development of localization algorithms, see e.g. 
|[3l , |4l , 151 , 151 , 0. HI, 0. Take trilateration, the most basic localization technique, for example: in 
a two-dimensional plane, the location of a sensor is estimated from the known locations of at least 
three non-collinear anchors and the measured distance, e.g. from received signal strength (RSS) or time 
of arrival (TOA), to each anchor; this is also termed as single-hop distance-based sensor localization. 
As an extension, in the multi-hop case where not every sensor directly refers to a sufficient number 
of anchors, iterative trilateration is proposed by using already localized sensors as pseudo-anchors (5J. 
Besides, approximate sensor localization can be realized using mere connectivity data between pairs of 
neighboring nodes, namely connectivity -based sensor localization, see e.g. I3l, |4ll. 

Single-hop sensor localization can be found in many practical localization scenarios, such as source 
localization and target tracking. Moreover, in simultaneous localization and mapping (SLAM) iflOl , ifTTI . 
a mobile robot equipped with a GPS receiver moves in a two-dimensional environment, measures relative 
location information to various objects, and then determines the locations of these objects; herein, the 
positions where the robot makes measurements can be abstracted as anchors, such that the localization 
procedure is single-hop. In lfl2l . |[T3l . a mobile anchor(s) is used to assist in sensor localization by 
providing relative location measurements to sensors at multiple positions, which is evidently single-hop. 
Therefore, it is meaningful to study single-hop sensor localization. 

Apart from localization algorithms, the performance limit of sensor localization, namely the lowest 
achievable error bound for location estimates, also attracts much attention. On the one hand, it provides 
a measure of theoretically optimal performance no matter what sensor localization algorithm is applied; 
on the other hand, it reflects fundamentals of sensor localization. Since the Cramer-Rao lower bound 
(CRLB) establishes a lower limit (or bound) on the variance for any unbiased estimator, it has been 
widely used in the performance analysis of sensor localization, see e.g. |[T4ll . (T5J, lfT6ll . 
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For single-hop sensor localization in a two-dimensional plane, the CRLB is a 2 x 2 matrix and turns 
out to be dependent on multiple factors, including measuring techniques, noise statistics of measurements 
and sensor-anchor geometries (i.e. relative node locations, or their coordinates). Since the trace of the 
CRLB matrix is the minimum mean square estimation error (MSE), it is often used as a scalar metric 
for the performance limit Ifl4l . ifTTl . Provided that the measuring technique and the noise statistics 
of measurements are both known, the scalar metric can be regarded as a function of the sensor-anchor 
geometry. A valuable problem arises to be seeking to minimize the scalar metric, equivalent to identifying 
optimal sensor-anchor geometries for sensor localization, and has been widely studied |fT8ll , |[T9l , |[20l , 
lf2TTl . Moreover, since the scalar metric can be infinite, implying that a localization problem is badly 
conditioned (e.g. the anchors being or nearly being collinear with the sensor) and localization algorithms 
almost fail, we should avoid the situations where the scalar metric takes large values. 

The conventional CRLB studies assume a deterministic sensor-anchor geometry which is normally 
unobtainable for a real localization problem. Instead, a probability measure for the sensor-anchor geometry 
might be available. For example, prior to deploying anchors into a field, we can assume a random and 
uniform distribution for the anchors' positions. Hence, we are motivated to study the scalar metric based 
on a statistical sensor-anchor geometry modeling method. This method is different from the modified 
CRLB in the estimation of nonrandom parameters in the presence of unwanted (or nuisance) parameters, 
see e.g. |[22l . ll23l ; to be specific, the nuisance parameters considered in the modified CRLB are random in 
real estimation problems, whereas the anchor positions in our case are fixed and known in real localization 
problems but are artificially randomized to obtain a better understanding of localization performance. To 
the best of our knowledge, this method has never been considered. 

Additionally, the following considerations also motivate this study. Firstly, supposing that every possible 
sensor-anchor geometry is equi-probable, the distribution of the scalar metric offers a broad, statistical 
view on the performance of single-hop sensor localization, in contrast to one deterministic quantity for 
a given sensor-anchor geometry. For instance, if the scalar metric hardly takes large values, there is less 
reason to worry about the sensor-anchor geometry; otherwise, one must impose proper control on it. 
Secondly, the mean of the scalar metric further establishes a lower limit on the performance of single- 
hop sensor localization given a fixed number of anchors with undetermined locations; in the situations 
where sensor-anchor geometries are unknown, e.g. prior to system deployment, this performance limit is 
certainly useful. Lastly, the statistical sensor-anchor geometry modeling method not only provides insights 
into single-hop sensor localization and in turn guides us in the design and deployment of wireless sensor 
networks, but also as a prototype paves the way for dealing with more complicated scenarios of sensor 
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localization. For instance, in a mobile environment, as may arise with ad-hoc networks, SLAM, mobile 
anchors assisting in sensor localization and so on, it is trivial to concentrate on localization performance 
in one particular time instant, whereas it is evidently more attractive to grasp the knowledge about 
the average localization performance over a period of time and/or in a wide region. Hopefully, these 
challenges can be addressed by the statistically modeling method. In summary, statistical sensor-anchor 
geometry modeling is a powerful method for investigating the performance limit of sensor localization. 

In this paper, we take into account RSS-based, TOA-based and bearing-only localization, respectively, 
and show that the scalar metric in each case is essentially a function of [/-statistics [see Section III-DI 
for further details]. Based on the theory of [/-statistics, we make the following contributions: (i) it is 
proved that as the number of the anchors increases, the scalar metric is asymptotically normal in the 
RSS/bearing case and converges to a random variable which is an affme transformation of a chi-square 
random variable of degree 2 in the TOA case; (ii) the convergence rate in the RSS/bearing case is shown 
to be as fast as 0(n~z), where n is the number of the anchors; (iii) the asymptotic formulas for the 
mean and standard deviation of the scalar metric are derived in both cases; (iv) last but not the least, 
these formulas are analyzed to demonstrate some properties of sensor localization and the conclusions 
are verified by extensive simulations. Although the derived formulas are asymptotic in the number of 
the anchors, in many cases they are remarkably accurate in predicting the performance limit of sensor 
localization, even if the number of the anchors is small. For instance, when the number of the anchors 
is as small as 6, the formula for the mean of the scalar metric in the TOA case is capable of providing 
accurate predictions. 

The remainder of this paper is organized as follows. The next section introduces the problem of single- 
hop sensor localization using the RSS measurements and formulates the scalar metric. Section UlTl presents 
the main results in the RSS case and extends them in the bearing case. Section HVl shows the results in 
the TOA case. Finally, we conclude this paper and shed light on future work in Section |V] 

II. Problem Formulation 

In this section, we first formulate the scalar metric for the performance of single-hop sensor localization 
using RSS measurements and then define a random sensor-anchor geometry model. Throughout this paper, 
we shall use the following mathematical notations: {-) T denotes transpose of a matrix or a vector; Tr(-) 
denotes the trace of a square matrix; Pr{-} denotes the probability of an event; E x {-) and Std x {-) denote 
the statistical expectation and standard deviation with respect to the subscripted variable x. 
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Fig. 1. Single-hop sensor localization. 



A. Single-hop Sensor Localization using RSS Measurements 

In a two-dimensional plane, consider a single sensor (or source, target) located at the origin and n 
(distance or angle) measurements made to this sensor at n known locations, as illustrated in Fig. Q] Here, 
the n known locations are abstracted as anchors and are labeled 1, • • • , n with the i-th anchor's location 
denoted by a^; the true distance between the sensor and the i-th anchor is denoted by di = ||aj||; the 
true angle subtended by a^ and the positive x-axis is denoted by on. 

Take a distance-based localization problem for example: at least 3 non-collinear anchors are required, 
namely n > 3; pair- wise distance measurements {di, i = 1, • • • , n} between the sensor and the n anchors 
are made and obey certain measurement models; then, the task is to find an estimate of the true sensor 
location using the observable set of distance measurements {di, i = 1, • • • , n}. 

Without loss of generality, we let the sensor be a transmitter and the n anchors be receivers. Denote 
by {Pi,i = 1, • • • , n} the measured received powers at the n anchors transmitted by the sensor, which 
satisfy the following assumption. 

Assumption 1: The wireless channel satisfies the log-normal (shadowing) model and the received 
powers {Pi, i = 1, ■ ■ ■ , n} at the n anchors are statistically independent. 

Remark 1: Assumption Q] is the basis for converting the RSS measurements (i.e. received powers) to 
distance estimates 11241 . and is commonly made in both theoretical studies (e.g. ||25l , ||20l , ||26l , ||27l ) 
and experimental studies (e.g. lfl4l . E8l ) on RSS-based sensor localization. It follows that 



Pi(dBm) = Po(dBm) - 10a log 10 + Z, 




(1) 
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where i-'j(dBm) = 101og 10 Pi, i-b(dBm) is the mean received power in dBm at a reference distance Rq, 
a is the path-loss exponent, and Z is a random variable representing the shadowing effect, normally 
distributed with mean zero and variance a 2 dB . As pointed out in ll29l . due to the fact that the log-normal 
model does not hold for d% = 0, the close-in distance Rq is introduced as the known received power 
reference point, and is virtually the lower bound on practical distances used in the wireless communication 
system. Further, Po(dBm) is computed from the free space path-loss formula (see, e.g. (291). 

B. A Random Sensor-Anchor Geometry Model 

To realize our analysis, we define a random sensor-anchor geometry model by assuming 
Assumption 2: The n anchors are randomly and uniformly distributed inside an annulus centered at 
the sensor and defined by radii Rq and R (R > Rq > 0). 

Remark 2: In Assumption [2j R is the upper bound on practical distances which is normally restricted 
by the factors determining path-loss attenuations; the lower bound, though set as the close-in distance 
Ro, is mainly devised to avoid the inconvenience in calculations, and theoretically speaking, can be any 
arbitrarily small positive number. By Assumption |2j each possible sensor-anchor geometry is as probable 
as another, in the sense that the sensor-anchor geometry follows a "uniform" distribution. Besides, {di, i = 
1, • • • , n} and {aj, i = !,-•• , n} are mutually independent with probability density functions (pdfs) 



fd{x) 



fa(x) 



0, others. 



i 0<x<2vr; 

2w ~ (3) 
0, others. 



C. The Scalar Metric 

Denote by (x, y) the unbiased position estimate of the sensor as well as estimation error (because this 
sensor is located at (0, 0)). Clearly, x and y are dependent on v and oj, where 

v = {Pi,i = !,-■■ ,n}, (4) 

uj = {di,ai,i = 1, • • • ,n}. (5) 

The pdf of Pi can be formulated as 

10 ex p{-K ln ft+^ ln t) 2 }' x>0; 



(In 10) %/27r<7 dB Pi 

0, X < 0. 



(6) 
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( ^ 10 ) ■ Then, we can formulate the Fisher information matrix (FIM) F&ss as follows. 



v _ - ,n cos a, v-m cos otj sin ctj 

F RSS = b\ ^ =1 ^r ^=i—Jf- |. 

En cos a, sin a, v-vn sir 

A detailed derivation can be found in lfl4l . If Frss is non-singular, the CRLB on the covariance of 
(x,y), denoted Crss> equals F^ s and satisfies 

E v {x 2 + y 2 ) >Tr{C RSS )- (8) 
That is, Tr(Cjtss) is the scalar metric for the performance limit with the expression (see Appendix IA1: 

\l^l<i<j<n y 

Remark 3: Since Tr(Cnss) is a function of random variables co, Tr(CRss) itself is a random variable. 
As such, by the distribution of Tr(Cjiss), E u (Tr(Cjiss)) an d Stduj(Tr(CRss))> we can statistically 
investigate the performance limit of single-hop sensor localization over a family of random anchor 
locations other than a specific localization problem with given anchor locations. Taking expectations 
with respect to u on both sides of ([8]), we can obtain 

E v ^{x 2 + f) > E u (Tr(CRss)), (10) 

namely that Eu(Tr(Ciiss)) is a lower limit on the performance of localizing one sensor if the spatial 
distribution of a fixed number of anchors (with undetermined positions) is known. Considering the 
involvement of E u (Tr(Cjiss)) an d Stdu(Tr(Cjiss))> we derive a theorem stating their convergence. 

Theorem 1: Let Tr(Cjiss) be defined by Q. 

. If n > 5, then E w (Tr(C RSS )) < oo; 

. If n > 9, then E^TriC^s)?) < oo. 

Proof: See Appendix iBl ■ 

D. U -statistics 

[/-statistics are very natural in statistical work, particularly in the context of independent and identically 
distributed (i.i.d.) random variables, or more generally for exchangeable sequences. The origins of the 
theory of U -statistics are traceable to the seminal paper 11301 , which proved the Central Limit Theorems for 
[/-statistics. Following the publication of this seminal paper, the interest in this class of statistics steadily 
increased, crystallizing into a well-defmed and vigorously developing line of research in probability 
theory. The formal definition for [/-statistics is presented as follows: 
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Definition 1: Let {Xi,i = 1, • • • , n} be i.i.d. p-dimensional random vectors. Let h(x±,--- ,x r ) be 
a Borel function on M rxp for a given positive integer r (< n) and be symmetric in its arguments. A 
U -statistic U n is denned by 

u n = rl{n ~ r)l ]T Kx ii: --- ,x ir ) (ii) 

l<ii <---<i r <n 

and /i(xi, • • • , x r ) is called the kernel of U n . 

It is obvious that Tr(Cnss) involves the ratio of two [/-statistics according to d9]), which inspires us 
to study Tr(Cjiss) through an asymptotic analysis based on the theory of [/-statistics. 

III. Results with RSS Measurements 

In this section, we endeavor to present an asymptotic analysis of Tr(Cnss) based on the theory of 
[/-statistics on account of the difficulty in conducting an accurate and direct analysis. 

A. Theories 

First of all, we obtain the following lemma for processing the ratio of two U -statistics, which is the 
basis for analyzing Tr(Cjiss) a $ well as the corresponding scalar metrics in other cases. 
Lemma 1: Given {X- , i = 1, • • • , n} and {X^ , i = 1, • • • , n} where 

• {X^ , i = 1, • • • , n} are i.i.d. random variables with bounded values; 

(2) 

• {X\ , i = 1, • • • , n} are i.i.d. random variables with bounded values; 

• {X^ , i = 1, • • • , n} and {xf^ , i = 1, • • • , n} are mutually independent, 

define two-dimensional vectors Xi = [ X^ ] T (i = 1, • • • , ri) and two [/-statistics 



1 n 



n 
t=l 



5„ = - T ^— £ [AT«AfW(X 
n(ro - 1) , .f— f _ L * J 

Then 



nfrt 

l<i<j<n 



(2) _ x (2), 



(13) 



T n 1 2cr? 

^ = + 3^ + M n + (14) 
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where m\ = E{X± ), &i = Std(X^), m,2 = E(sin 2 (Xi — X^')), R n is the remainder term, and 



2A 2 



Y / gi(X i ) + - V 92(X h X j ), (15) 

n ^ n(n — 1) z — ' 

i=l v y l<i<j<n 

ff i(X) = mi , (16) 

1 Xf)+Xf 2X«X« xf^fsin^-Xf) 

ffiiaj = g — - — s4 — • ( 17 ) 

m\m2 m\m2 m\m2 mfm^ 



For any e > 0, as n — > oo, i? n satisfies 

Pr{|nl2n| >e} = Oin- 1 ), (18) 

Pr{|n(lnn)J?„| > e} = o(l). (19) 

Proof: See Appendix ICl ■ 
Remark 4: By Lemma[T] the ratio of two [/-statistics (i.e. ^) can be expanded into a linear expression 

consisting of one constant, one term converging to with the rate 0{n~ l ), one [/-statistic (i.e. M n ) with 

mean and one remainder term (i.e. R n ) converging to in probability. 

Obviously, Tt(Crss) can be expanded based on Lemma [TJ By letting Xf' = jp and x\ 2 ^ = Oj, we 

can derive m-2, = 0.5, and 

mi - A^i)- (20) 



(Tl 



1 /21nA\ 2 

I Ra \ (21) 



M -Rg^ 2 \R 2 -R 2 o)' 

One of our main results is further summarized as follows. 

Theorem 2: Use the same notations b and Tr(Cjiss) as in Section [TT] and mi and <ti as defined by 
d20b and (|2TT) . Define a sequence of random variables 

W n = ( ^(n-l)bmt \ _ ^ _ 2c^ 

V 4ai / (Tl V nm l 

Then, as n — )• oo, W n converges in distribution to a standard normal random variable. 

Proof: See Appendix 151 ■ 
Remark 5: In view of the affine relationship between W n and Tr(Cnss)> it is straightforward that 
Tr(Cjiss) is asymptotically normal. Therefore, given a sufficiently large n, the distribution of Tr(Cjiss) 
can be approximated by the following normal distribution 

M l 4 + ,( ^ W (23) 

\{n — \)bmi n(n — \)bm\ \ yjn{n — l)bm\ J J 
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Most importantly, the above normal random variable enables us to analytically study the performance 
limit, i.e. Tr(Cnss)- Firstly, we can obtain a comprehensive knowledge about how Tr(Cnss) is sta- 
tistically distributed and how Tr(Cjiss) is dependent on n. Secondly, using the normal distribution 
function from (l23l) . we can compute the probability that Tt{Crss) is below a given threshold for a 
known value of n; in turn, we can determine a threshold such that Tr(Cjiss) is below the threshold 
with a certain confidence level, say 0.99; in addition, we can find the minimum n such that Tr(Cjiss) 
is below a given threshold with a certain confidence level. Such analysis is undoubtedly helpful for the 
design and deployment of sensor networks. Lastly, the moments of Tr(Cjiss) can be approximated by 
the corresponding moments of the normal variable defined by (|23T ). namely, 



4 8a? 



EUTr(C RSS )) « t ttt I 7 ^rj-g, (24) 

(n — \)bm\ n(n — l)bmf 



StduiTriCnss)) « 7=73 W 2 > ( 25 ) 



4cji 

\/n(n — l)6m| 

which characterize the relationship among the mean and standard deviation of Tt{Crss)^ the number 
of the anchors, the noise statistics of the RSS measurements and the spatial distributions of the anchors. 

A natural question arises as to how large n should be to obtain a good approximation; this gives rise 
to the convergence rate study. In the literature of U -statistics, the Berry-Esseen bound was developed for 
characterizing the convergence rates of [/-statistics 01! , |[32l . Considering the fact that W n is affine to a 
[/-statistic (i.e. M n ) according to the proof of Theorem |2j we propose the following theorem describing 
the convergence rate of W n in the way similar to the Berry-Esseen bound. 

Theorem 3: Use the notations in Theorem |2] and define 




v z = E\ 

Then, as n — > 00, 



sup|F n (x) - < 



'^ + 2 4\ (x 2 -l)e 



n-2+0{n~ l ) (27) 



60-3* J X/27T 

where F n (x) is the distribution function of W n and 3>(x) is the standard normal distribution function. 
Proof: See Appendix [E] ■ 
Remark 6: Theorem [3] shows that as n — > 00, the density of W n converges to standard normality with 
the rate 0(n 2). Additionally, it can be verified that the coefficient associated with n 2 is a function of 
the ratio ; that is to say, the convergence rate of the density of W n is not determined by the individual 

JL 

Rq ' 



values of Rq and R, but by the ratio -3^. 
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B. Simulations 

In this subsection, we would like to carry out simulations to verify Theorem |2] and the accuracy of 
using (1241 ) and (|25T ). In the simulations, the parameters describing the wireless channel, i.e. a, a^s and 
Ro, are set as 2.3, 3.92 and 1 m, respectively, which were measured in a real environment lfl4l . 

In the first place, we plot the actual distribution functions of Tt(Crss) from simulations (with the 
legend "Simulation") and the normal distribution functions defined by (1231 (with the legend "Formula") 



for n = 5, 10, 15, 20 in Fig. |2(a)| As can be seen, when n = 5, the discrepancy between the pair of 
distribution functions is quite obvious, but when n = 10, the discrepancy becomes very small, and 
when n = 15 or 20, the discrepancy can be negligible. The gradually diminishing discrepancy arises 
for two reasons: the intrinsic error in approximating a [/-statistic by normality, and the existence of the 
remainder term R n which obeys Pr {\Rn\ > -} = 0(n _1 ), see (TT8T ). and though nonzero is neglected 
in the calculation. In Fig. |2(b)[ we plot and compare the pdfs: (i) the overall shapes of the actual pdfs 
(with the legend "Simulation") are quite similar to normality; (ii) the discrepancy in between reduces 
with n increasing too. All those observations are consistent with and in turn demonstrate Theorem [2] 

In the second place, we plot E^^t^rss)) an d Std u {Tr(CRss)) from both simulations and the 
formulas (1241 and d25t in Figs. |3(a)"| |3(c)[ [3(b)] and |3(d) and evidently, the larger is n, the more accurate 
are the formulas. Note that when n = 5, the associated curve corresponding to the actual values of 
Std u (Tr{CRss)) in Fig- |3(b)| is non-smooth, and the most probable reason is that the actual value of 
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10-, 




(a) Simulation Mean 



10-, 




(c) Analytical Mean 




(e) Mean 

Fig. 3. The means and the standard deviations of Tr(Cnss) 
relative errors, with oa,B = 3.92, a — 2.3 and Ro = 1 m. 
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(b) Simulation Standard Deviation 



3-, 




n 



(d) Analytical Standard Deviation 




(f) Standard deviation 
from the simulations and the formulas, and the corresponding 
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Stdu(Tr(Cjiss)) is infinite when n is as small as 5. For better comparison, we plot the relative error\\ 
in Fig. |3(e)| and |3(f)| It can be seen that: (i) E u (Tr(CRss)) is underestimated by ((24b when R is small, 



say R = 2 m, but is overestimated by (|24l) when R is large, say R = 10 m, while the associated 
absolute value of the relative error decreases with n increasing in most cases; (ii) Std^^r^Rss)) is 
underestimated by (l25l) and the associated absolute value of the relative error decreases with increasing 
n and R; (iii) suppose the absolute value of the relative error below 10% is acceptable: when R = 2 m, 
(1241) is applicable if n > 6, but d25l ) is not applicable even if n = 20; when R = 10 m, both (1241 and 
d25l ) are applicable if n > 11. 

In what follows, we present several useful remarks on the properties of sensor localization provided 
that ([24]) and (|25]> are applicable. It is notable that in ([24]) and (|25]>, Eu(Tr(CRss)) and Std^iTr^Rss)) 
normalized by R? (or Rq) are dependent upon the ratio -^; hence, we simplify the discussion involving 
both Rq and R by letting Rq = 1 m and only concentrating on R. 

Remark 7: According to (l24l.E'„i (TriCrfi.g)) is in inverse proportion to n, and thus a critical value 
n* differing from the parameters Ro,R,adB and a can be determined, such that having more anchors 
than n* contributes little to the quality of sensor localization. 

Remark 8: It can be easily deduced that both (1241 and (|25T ) monotonically decrease with R decreasing, 
as illustrated in Fig. |3(c)| and |3(d)[ the reason is that long distance measurements from RSS suffer 
greater errors, and thus produce worse localization performance. Therefore, given a fixed n, distance 
measurements from a sensor are better made at locations as close to the sensor as possible. Moreover, 
it turns out that using more distance measurements spread over a wide range is not necessarily better 
than using fewer distance measurements but spread in a narrow range in terms of E^iTr^Rss))- For 
instance, E u (Tr{CRss)) is approximately 0.52431 m 2 given n = 15 and K = 6m, but is around 0.43174 
m 2 given n = 10 and R = 4 m. Thus, tradeoff should be made between the number of the anchors (i.e. 
n) and their spreading (i.e. Ro and R) in sensor localization. 

Remark 9: Though we have discussed the positive and negative impacts of increasing n and R on 
localization performance separately, the variables are correlated in some situations, and so the impacts 
are related. Normally, increasing all the transmission powers in a wireless sensor network enlarges the 
communication coverage of every node, and both n and R tend to rise; finally, a positive impact is 
embodied, since Tr(CRss) and its mean will definitely decrease according to lfT4l . 

'Relative error is the ratio of the difference between the quantity from simulations and that from a corresponding formula to 
the former one. 
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Remark 10: The dispersion of the distribution of Tr(Cnss) reflects its sensitivity to sensor-anchor 



geometries. Specifically, as illustrated in Fig. |2(b)| with a large dispersion, say n = 5, the chance of having 
two different sensor-anchor geometries to lead to a big difference in the resulting values of Tr(Cnss) is 
large, implying a large sensitivity, and we should be careful about sensor-anchor geometries; by contrast, 
with a small dispersion, say n > 15, the chance is certainly small, so is the sensitivity, and there is 
less reason to worry about sensor-anchor geometries even if the anchors are randomly deployed. Given 
a random variable, the coefficient of variation, defined to be the ratio of its standard deviation to its 
mean, is a normalized measure of dispersion of its distribution. Accordingly, the coefficient associated 
with Tr(Cnss) has the order of 0(n~z); the less is the coefficient, the smaller is the sensitivity. 



C. Expansion to Bearing-only Localization 

A bearing is the angle between a north-south line and a line connecting a sensor to an anchor, and is 
measured in a clockwise direction. In bearing-only localization, bearing measurements associated with 
one sensor and at least two anchors (which are not collinear with the sensor) are required to determine 
the sensor location. We still consider the sensor and n (n > 2) anchors, as shown in Fig. [T] But in our 
study, the angles {aj, i = 1, • • • , n} are assumed to be measured as bearings, due to the fact that this set 
of measurements is equivalent to the set of real bearing measurements as far as our study is concerned. 
Henceforth, we make the following assumption as is commonly used in the studies on bearing-only 
localization ETI. l33l. 

Assumption 3: The bearing measurements = 1, • • • , n} are statistically independent and Gaus- 

sian with means {cti,i = 1, • • • , n} and the same variance a\. 

In bearing-only localization, any anchor should not overlap with the sensor; otherwise, the correspond- 
ing bearing measurement will be invalid. To model a random sensor-anchor geometry, we assume that the 
n anchors are randomly and uniformly distributed inside an annulus centered at the sensor and defined 
by radii Rq and R {Rq < R), as in Assumption [2] in Section ITT] 

Define Fb to be the associated FIM in bearing-only localization, and according to ETT . we have 

1 / v- ,n cos 2 otj v-vn cos Cjj sinc^ \ 

F B = \[ L * =1 L * =1 . (28) 

V i^i=i ^ 2-,i=i — 3T~ / 

Remark 11: Obviously, on replacing by b, Fb will have the same form as the FIM in the RSS 

case, i.e. Frss in ©■ Hence, all the conclusions about Tt(Crss) except those relevant to b are still 

correct in bearing-only localization. 
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IV. Results with TOA Measurements 



Since multi-path fading due to reflection, diffraction and scattering of the radio signal causes variations 
in the received power and severely reduces the accuracy of the RSS ranging approach, RSS-based 
localization normally achieves low performance. By contrast, comparatively accurate distance estimates 
can be obtained using TOA measurements. In this section, we extend our study in the TOA case. 

A. Problem Formulation in the TOA Case 

TOA refers to the travel time of a radio signal through a medium from a single transmitter to a remote 
single receiver. By the relation between signal propagation speed in this medium, the time is a measure for 
the distance between transmitter and receiver. Denote by {Tj, i = 1, • ■ • , n} the measured TOA between 
the sensor and the n anchors. As in both theoretical studies ll34l . ll35l . ll36l and experimental studies |[T4l 
on TOA-based sensor localization under line-of-sight conditions, we assume 

Assumption 4: {Ti,i = 1, • • • ,n} are statistically independent and Gaussian with means = 
1, • • • , n} (c is the speed of propagation) and same variance a\. 

To model the random sensor-anchor geometry, we make the following assumption. 

Assumption 5: The n anchors are randomly and uniformly deployed within a circle of radius R (R > 0) 
centered at the sensor. 

Remark 12: Because the TOA measurement model is still valid when the actual distance d{ is 0, we 
do not need to restrict d{ to be greater than as we do in the RSS case, and simply make Assumption [5] 
It turns out that {di,i = 1, • • • , n} and {cti, i = 1, • • • , n} are mutually independent; the only difference 
from the RSS case is that the pdf of dj is 

Similarly to the RSS case, we can obtain the FIM and the trace of the CRLB matrix in the TOA case 
as follows 



Evidently, Tr(CTOA) is a random variable and is independent of {ck,i = 1, • • • , n}. Furthermore, the 
sufficient conditions in Theorem Q] also hold for the first and second moments of Tr(CxoA)- As before, 
Tt{Ctoa) is used as the scalar metric for the performance limit in the TOA case. 



Tr(C TO A) 




(30) 



(29) 
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B. Theory 

By using Lemma [T] we derive the following theorem. 

Theorem 4: Let a? and Tt{Ctoa) be the same variables as defined in the previous subsection and 
define a sequence of random variables 

V n = ( ^^f ) T ^Ctoa) -2n + 2. (31) 

Then, as n — > oo, V n converges in distribution to a chi-square random variable of degree 2. 

Proof: See Appendix [0 ■ 
Remark 13: According to Theorem HI if n is sufficiently large, the pdf of Tt(Ctoa) can be approx- 
imated by 

n(n — 1) „ ,n(n — 1) 

i^ ( i4^- 2n+2) - (32) 

where f x {-) is the pdf of the chi-square random variable of degree 2. Therefore, we can approximate the 
moments of Tv{Ctoa) by the corresponding moments of the random variable defined by d32~l ), namely, 

E w {Tt(C TO a)) « -^j-, (33) 
n — 1 

ialc 2 

StdUTr(C TOA )) « — i — — 7T- (34) 

nyn — 1) 

C. Simulations 

In the simulations, we let otc = 1 m and plot the distribution functions and pdfs of Tt{Ctoa) from 
both simulations (with the legend "Simulation") and the random variable defined by (|32~1) (with the legend 
"Formula") for n = 5, 10, 15, 20 in Fig. |4] It can be seen that the discrepancies between the pairs of 
distribution functions (or pdfs) are not as obvious as in the RSS case, and also vanish with n increasing, 
which is consistent with Theorem [4] Note that every curve of the pdf from the simulations contains a 
sharp curvature, but the corresponding curve from the given random variable is perfectly smooth; this does 
not invalidate Theorem HI due to the fact that as n increases, the curve associated with Tt{Ctoa) tends 
to become an impulse such that the minimum of Tt{Ctoa) will overlap with the value of Tt{Ctoa) 
attaining the maximum of the pdf. 

Remark 14: From Fig. 4(a)[ we can obtain a clear understanding about Tt{Ctoa)- When n = 5, 



Tt(Ctoa) has a lower limit around 0.8 m 2 and can be far greater than (arc) 2 = 1 m 2 (a^c denotes 
the accuracy of distance measurements from TOA); when n > 10, the lower limit of Tt{Ctoa) reduces 
to be less than 0.4 m 2 and Tr(CTOA) could hardly take values above 1 m 2 . 
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Fig. 4. The distribution functions and pdfs of Tr(CroA) with R = 10m and o^c = lm. 
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Fig. 5. The means and the standard deviations of Tt[Ctoa) from the simulations and formulas, and the corresponding relative 
errors, with R = 10m and ore = lm. 



Next, we plot E u (Tr(CTOA)) an d Std w {Tr{CTOA)) from simulations and approximate formulas d33l) 
and d34l) in Fig. |5(a)[ as well as the associated relative errors in Fig. |5(b)| Although both formulas produce 
overestimates, the relative error of d33l ) is much smaller than that of (f34t . Assuming that a relative error 
below 10% is acceptable, 031 ) is applicable if n > 6, whereas d34l ) is not applicable even if n = 20. 

Remark 15: Since (1331 ) is extraordinarily accurate and shows that Eu(Tr{CTOA)) is inversely propor- 
tional to n, we can determine a critical value n* differing from otc, such that having more anchors than 
n* contributes little to the quality of sensor localization, as in the RSS case. Further, if n is sufficiently 
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large such that (l34l ) is applicable, the coefficient of variation for Tt{Ctoa) approximately equals ^ 
implying that Tt(Ctoa) reduces to with a faster rate than Tt(Crss) as n increases. 



In this paper, we investigated the performance limit of single-hop sensor localization using RSS, TOA 
or bearing measurements by statistical sensor-anchor geometry modeling. That is, the anchor locations 
are assumed to be random, and the scalar metric for the performance limit of sensor localization, i.e. 
the trace of the associated CRLB matrix, consequently becomes random. We came up with formulas 
expressing the asymptotic behavior of the scalar metric in terms of distribution, mean and standard 
deviation. Specifically, as the number of the anchors goes to infinity, the scalar metric in the RSS/bearing 
case is asymptotically normal and its rate of convergence to normality was also derived; in the TOA 
case, the scalar metric converges to a random variable which is an affme transformation of a chi-square 
random variable of degree 2; we presented approximate formulas for means and standard deviations of the 
scalar metric in both the RSS/bearing case and the TOA case. Although these formulas are asymptotic 
in the number of the anchors, they turn out to be remarkably accurate in predicting the performance 
limit of sensor localization in many cases, even if the number of the anchors is fairly small. In addition, 
we analyzed the formulas to demonstrate some general properties of sensor localization and carried out 
extensive simulations to verify the conclusions. 

Considering the similarities between the models for bearing measurements and angle of arrival (AOA) 
measurements, we can easily expand the conclusions in the RSS/bearing case to AOA-based localiza- 
tion. Furthermore, distance measurements in range-only localization are often modeled to be mutually 
independent and Gaussian 1211 . which is the same as occurs with TOA measurements, and thus, it is 
straightforward to expand the conclusions in the TOA case to range-only localization. In future work, 
we may expand our study into three-dimensional space and multi-hop sensor localization. 

Appendix A 
Derivation of Tr(CRss) 
According to the formulation of Frss, we have 



V. Conclusion and Future Work 




(35) 
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Then, 

Tr(C RSS ) 



1 v^n J_ 
b l^i=l dj 

i sin 2 ct ; 

=1 dj 

1 v-^n 1 
6 Z^i=l 



En cos 2 \ ( v-m sin 2 \ ( v-vn cos Qj sin otj 
i=l d 2 J 12^=1 dj ) \l^i=l 



2 

na, \ 

3r 



Ecos Oi sin ctj sin(a,: — Qj ) 
l<ij'<n Sfdj 

1 v^n 1 



E(cos a; sin sin(aj — a.,)+cos ctj sin a; sin(aj — a;)) 
1 



1 v^n 1 

6 Z^i=i dj 



\<i<j<n dfdj 

Appendix B 
Proof of Theorem Q] 

Proof: Let the denominator of Tr(Cjiss) be 

Yn= E feliS>Y (36) 

l<«<j<n \ "* a i / 

Since the numerator of Tt(Crss) is obviously bounded, once the first and second moments of Y~ l are 
finite, the corresponding moments of Tr(Cnss) are finite too. Supposing the pdf of a random variable 
X, denoted f(x), is continuous on [0,+oo), if /(0) = and /'(0) < oo, then E(X~ l ) < oo from ||37l . 
We let f n (x) be the pdf of Y n . To prove E{Y~ l ) < oo, we first show / n (0) = given n > 4 as follows. 



/ .(0) = lim JV t y ^'>- JV t y ^ > 
= lim -[■ [ f . _, , faiptx)--- fa{at n )fd{dx)--- fd(d n )dai---da n ddi---dd n 

J J „(„_!) Z-i<i< j <„^ 535a JS£ 

1 Iff 

< - — — lim -/•••/ / „ s da\ ■ ■ ■ da n 

v ; ^ • / T7i^ry^i<i< J <4^ — 5535 — 

1 Iff 

< - — rj lim -/•••/ fiaidc^dasdc^ 

(2^) 4 £^0 8 J Jj2 l < i<j < i (^(a i -a j ))< ^-^ 

< -r-^j I™ - / / , ^ / . ^ / , ^ da 4 



(2vr) 4 M0£ J- W<ai < n J\ a2 - ai+k7r \< ^(—^^ J\a 3 -a 2 +kn\<y/ ^"-l )R4s J \a 4 -a 3 +kTr\<yf^±^ 

dct3da2dai, k = —2, —1,0,1,2 

lim CKe** 3 - 1 ) = 0. 
£-►0 
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Next, we need to prove f' n {Q) < oo, namely 

f' n (0) = lim lim — / • • • / f a (ai) ■ ■ ■ f a {a n )fd(di) ■ ■ ■ fd{d n )dai • • • da n dd\ ■ ■ ■ dd n < oo. 

(37) 

Because / n (£) is finite for any £, the limit calculus results in 

fn(0) = lim —:/ • • / / Q («i) • • • f a (a n )fd{di) ■ ■ ■ fd{d n )da\ ■ ■ ■ da n ddi ■ ■ ■ dd n . (38) 

£->0,e->0 J Jt<Y n <$+e 

On replacing £ and e by r cos and r sin 0, one has 

= lim-= — „ /•••/ /«(ai) • ■ • fa(a n )fd{di) ■ ■ ■ f d (d n )da 1 ---da n dd 1 -- 

r^O r z COS dsmd J 7 r C o S 0<Y n <r(cos0+ S in0) 

1 1 f f 

< - — — lim -= / • • • / • • • dd„ . 

(27r)« r->0 Hcos0sin6> J ./y n <r(co S 0+sin0) 

The term on the right hand side (RHS) of the above inequality can be treated similarly to proving 
/n(0) = 0, with the result that for n > 5, 

lim — J ■ ■ ■ I dot\ ■ ■ ■ da n < oo (39) 

r->0 et, J J r cos 6><Y„<r(cos 6»+sin0) 

and thus E(Y~ 1 ) < oo. It is straightforward to extend the result about the first moment to the second 
moment by following a similar line of argument, and then the theorem is proved. ■ 

Appendix C 
Proof of Lemma Q] 

Prior to the proof, we review some background about [/-statistics. Firstly, given a U -statistic U n with 
kernel h(x\, • • • ,x r ), it can be rewritten using Hoeffding's (or H-) Decomposition as follows (see ll38l ) 

^ = E j!(n re T j)! E A^,-..,^.) (40) 

j=l l<i 1 <---<ij<n 

where 

Ai(xi) = E{h(x 1 ,X 2 ,--- ,X r )), 

A 2 (xi,x 2 ) = E(h(xi,X2,X 3 , ■ ■ • ,X r )) - Xi(xi) - Xi(x 2 ), 



r-1 

X r (x! ■ ■ ■ ,x r ) = ft(xi, - ,x r ) -} } Xj(x h , ■ ■ ■ ,Xi 3 ). 

j=l l<ii <---<ij <r 



September 19, 2011 



DRAFT 



21 



We define 



and have 



A k = ^2 ^k( x h, ■ ■ ■ ,Xi k ) 

l<ii<—<ik<n 

l<ii<-<ir<n k=l vv J K ' 



At 



(41) 



(42) 



Then, the following lemma concerning can be derived. 

Lemma 2: (Lemma AI in |f39l ) For any given q > 2, if E\h(X\, • • • ,X r )\ q < oo, then there exists 
a positive constant a, which may depend on h(Xi, ■ ■ ■ ,X r ) and the distribution of X\, ■ ■ ■ ,X r but is 
independent of n, such that 



E\A k \ q < an~r 



(43) 



In addition, we can obtain the critical lemma as follows. 

Lemma 3: Let r be a fixed positive integer. For any given q > 2, if E\h{X\, ■ ■ • ,X r )\ q < oo, then 
there exists a constant a independent of n, such that 



E 



n 



£ h(x it ,--- ,x ir ) 



l<ii <---<i r <n 

Proof: From ((421) and c r -inequality, we get 



< an 2 . 



(44) 



E 



n~ r h(X h ,--- ,X ir ] 

l<ii <---<i,,<n 



k=l 



n 



(n-k)\ 



(n — r)!(r — A;)! 



A, 



(45) 



Noting that n 



(n-k)\ 



(n—r)\(r—k)\ 

which are all independent of n, such that 



0(n k ), by Lemma |21 we know that there exist constants ai, • • • ,a r , 



k=l 



n 



(n-k)\ 



(n — r)\(r — k)\ 



Ai 



< J> fc n"^). 



(46) 



k=l 



Thus, by letting a = r max{cti, • • • , o r }, we have 

E 



n~ r Y h(X h ,---,Xi, 

l<ii <"'<i r <n 



< an 2 . 



(47) 



The proof below for Lemma [His in line with that of Theorem 1 in ll39l . but Lemma Q] enhances the 
statement about the remainder term R n by (fT8l) and ( fT9l ) which are key to prove the subsequent theorems. 

Proof: For ease of presentation, we say a random variable satisfies the Condition A if and only if 
it satisfies the same two conditions (i.e. ( fT8l ) and ( fT9l » as R n does in Lemma [TJ The skeleton of the 
proof is: Step 1, applying Taylor expansions on S*" 1 ; Step 2&3, identifying the resulting terms in the 
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expansion of 5" 1 which satisfy the Condition A; Step 4, rewriting S 1 " 1 by combining the terms satisfying 
the Condition A into one term; Step 5, multiplying 5" 1 and T n , identifying the resulting terms which 
satisfy the Condition A and combining them into the remainder term R n . 

Step 1: Let e and q be any given positive real numbers and q > 2. The i^-decomposition of S n is 



2 2 

S n = m s + -Y j Ci(X i ) + - V UXi,Xj] 

n ^ n[n — 1) ^ 

i=l K ' l<i<j<n 



where m s = m\m 2 and 



Ci(X0 = (X\ l) - mi )m x m 2 



(48) 

(49) 
(50) 

(51) 

where < | i? n | < | S n — m s \ . We shall identify the terms on the RHS of (15TI ) satisfying the Condition A. 

Step 2: Firstly, consider \m s + "d n \~ A \S n — m s \ 3 . It follows from (l48l and c r -inequality that for any 
V > 1/3, 



( 2 (Xi, X 3 ) = X^xf* sm 2 (xP - Xf ] ) - {X\ X) + Xf ] ) mi m 2 + m\m 2 . 
Applying Taylor expansions on S" 1 around m s , we obtain 

S" 1 = mj 1 - m~ 2 (S n - m s ) + m~ 3 (S n - m s ) 2 - (m s + ti n )~ 4 (S n - m s ) 3 



\S n -mJ 3p <2 3 P- 1 



£CiPQ) 



1=1 



+ 2 3p- 



n(n — 1) 



- &(Xi>Xj) 



l<i<j<n 



3p 



(52) 



For the first term on the RHS of d52l ), from Lemma [3] and Markov's inequality, we have 



Pr < n 



Pr < n(lnn) 



n 



J2ci(Xi 



i=i 



i=i 



> £ 



> £ 



0{n 



ofl 



(53) 



(54) 



Hence, the first term on the RHS of (152] ) satisfies the Condition A. Similarly, we can prove that the 
second term satisfies the Condition A too. Furthermore, from d54l , the second term on the RHS of d52l ) 
satisfying the Condition A, and 



Pr {n(lnn)\S n - m s \ 3 > e] 



(Inn) 


n 


3 

+ n(lnn) 




i=l 





n(n — 1) 



< Pr < 



2n(ln n) 



i=i 



>-} +Pr { 2n(lnn) 



l<j<j<n 

2 



> e 



n(n — 1) 



l<i<j<ra 
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we have 

Pr {ra(lnn)|S n - m s \ 3 > e} = o(l). 

Similarly, we have 

Pr {n\S n - m s \ 3 > e} = 0(n~*), 

and thus, |5 n — m s \ 3 satisfies the Condition A. 

Based on Lemma [3] and Markov's inequality, we obtain 




= O(n-f), 



>e\ = 0(nri). 
This gives 

Pr{\S n -m s \ >e} =0(n~5), 

and noting that 

Pr\\m s + K\<^\ = Prl n< ~~£] 

< Pr {\S n - m s \ > ^} , 

we have 

Pr{\m s + $ n \ < ^} =0{n~i). 

Furthermore, by (l55l) . (l6Tb and 

Pr |n(lnn)|5 n — m s \ 3 \m s + ?? n |~ 4 > e} 
= Pr{{n{\nn)\S n -m s \ 3 \m s +°d n \- A > e} f|{|m s + tf„| > ^}} 
+Pr{{n(lnn)\S n -m s f\m s + $ n \- 4 > e} f|{|m s + n | < ^}} 

< p r !^n(\nn)\S n - m s \ 3 (^y 4 > + Pr {\m s + $ n \ < ^ } , 

we have 

Pr{n(lnn)|S n - m s \ 3 \m s + ^ n |" 4 } = o(l). 

Similarly, we have 

Pr {n|S n - m s | 3 |m s + tf n |~ 4 > e} = 0(n~f ). 
Hence, we conclude that \S n — m s \ 3 \m s + $ n \~ A satisfies the Condition .4. 
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Step 3: Secondly, we deal with the term (S n — m s ) . 



(S n - m s y 



n 



E\( 1 (x i )\* + ^Y.(^ x rf- E \^ x ^ 2 ) + ^ E (CiWCi^)) 



i=l 



l<i<j<n 



l<i<j<n 



(66) 



n 2 {n — 1) ' 

In the second term on the RHS of (l66l ). CiPQ) 2 — _E|Ci(Xj)| 2 can be regarded as the kernel of a [/-statistic; 
then according to Lemma [3] and Markov's inequality, we have 



Pr < n 



Pr < n(lnn) 



ti- 



ll- 



^2{Cii.Xi) 2 -E\Ci(Xi 



1=1 

n 



8=1 



> e 



> e 



0(n"), 



(67) 



(68) 



and thus, the second term satisfies the Condition A. For the third term on the RHS of (1661) . we have 

8m7 3 



n- 



E Ci(^)Ci(x 



l<i<j'<n 



-^4r E ci^Xi^)-^^ E ciracira. 

n(n — 1) _ , n z (n — 1) _ , 



l<i<J<n 



l<i<j'<n 



(69) 



Based on Lemma [3l we can instantly obtain that the second term on the RHS of the above equation 
satisfies the Condition A. Define 



a; = 

l<i<j'<n 



(70) 
(71) 



From Lemma |2l we can derive 

J5 



l<<j<j<n 



O In" 9 



(72) 



n 2 (n — 1) ' 
v ; i=l 

and thus, the last term on the RHS of (l66l ) satisfies the Condition A. 

Step 4: We rewrite (|5"TT) by combining all the terms which satisfy the Condition A into a new term 
R' n (which, as a consequence, satisfies the Condition A too), and obtain 

S„ 



2mJ 2 



i=l 



n(n — 1) 



Q™— 3 

ram — 1) _ -f— ' , 



l<i<j<n 



(73) 



l<i<j<n 
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Step 5: Similarly to S n , T n can be formulated using 77-Decomposition as follows 

1 " 

T n = m t + - Vn^j) (74) 
n *— ' 

i=l 

where m t = mi and Ti(Xj) = xj; — m\. 

Consider S 1 " 1 ^. Since T n is bounded, it is straightforward that T n 7?^ satisfies the Condition A. Then, 
we multiply every term on the RHS of d73l except 7?^ by the second term on the RHS of d74l and 
identify the resulting terms satisfying the Condition A. Firstly, according to Lemma [3j it is obvious that 
the second term on the RHS of (1731 ) times the second term on the RHS of d74l ) satisfies the Condition 
A. Secondly, the third term on the RHS of (1731) times the second term on the RHS of d74l produces 

O777 ^ 2?r? ^ ^ 

-^EiUX^Xr)) + -L^-^^TOtiM - £(Ci(Xi)n(Xi))) 

i=i 

<y —2 

+J ^t E (CiWnW + CiWnW). 

l<i<j'<n 

In the above expression, the second term satisfies the Condition A from Lemma [3j the third term is 

2?77 2?T7~^ 

—f" E (Ci(^)n(X J ) + Ci(^)ri(^)) = — ^— £ (Ci(*i)n(*i) + Ci(*i)n(*i)) 

\<i<j<n v y l<i<j<n 

~ n2fn s _ n £ (Ci(^)n(X i ) + Ci(^)n(^)),(75) 

and the second term on the RHS of the above equation satisfies the Condition .4 from Lemma [3] Thirdly, 
similarly to the treatments in the last term on the RHS of (l66l ). we can show that the fourth and fifth 
terms on the RHS of ( [731 times the second term on the RHS of d74l ) both satisfy the Condition A. 

Combining all the associated terms satisfying the Condition A in S~ x T n into a new term R n , we then 
prove this lemma. ■ 

Appendix D 
Proof of Theorem [2] 

Proof: We use the notations in Lemma Q] and define a g = Std(g\(Xi)). Moreover, we have 

T *'< c «-» = (m^T)) t- <76) 

It follows from Lemma Q] that M n is a £7 -statistic of degree 2 and 

Pr (|v^n| > -A—} = (77) 

implying that yfnR n converges to in probability. By a g > (due to o\ > 0) and Theorem 1 on page 76 
in |[38l , ^o"~ 1 v / nM ri converges to standard normality as n — > oo. By letting W n = ^<7~ 1 -0T(M Tl + 7? n ), 
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we conclude that as n — > oo, W n converges in distribution to standard normality from Theorem 12 on 
page 16 in EOl. ■ 

Appendix E 
Proof of Theorem [3] 

Proof: Let G n {x) be the distribution function of \o~ x ^JriM n . From Theorem 1.2 in [411], we have 



sup\G n {x)-G(x)\ = 0(n~ 1 ] 



where 



Then, we can obtain 



G{x) = $(x) 



^3 + 



6aj 



6al 



<3? (x)n 2 . 



(78) 



(79) 



\G n (x)-*(x)\ < \G(x)-$(x)\ + \G n (x)-G(x)\ 



< 



. Serf 



mi - <f>"'(x) 



6al 



n 2 + 0(n 1 ). 



(80) 



For any e > 0, using Lemma 3 on page 16 in 1401 . we have 



-a g 1 VnR n 



>£> < F n (x) < G n {x + e) + Pr 



G n {x-e)-Pr 
and thus 

\F n (x) - <f>(x)\ < max{|G n (x ± e) - $(x)|} + Pr 
\F n (x) - $(x)\ < m&x{\G n (x ± e) - $(x ± e)| + |$0 ± e) - $(x)|} + Pr 



> e 



-cr 9 l y/nR r 



-a g l y/nR n 



> e 



By letting e = n 2, we have Pr{\^a g l y/nR n \ > n 2 } = 0(n x ) from Lemma [TJ Moreover, it is 
easy to show ± n~2 ) — <i>(x)| = 0(n _1 ). Then, the theorem is proved. ■ 

Appendix F 
Proof of Theorem |4] 

Proof: We use the same notations as defined in Theorem [2 By letting xjp = 1 and xf^ = oti, we 
obtain a g = 0, m\ = 1, va2 = 0.5 and 

2(J 2 \ T n 



Tr(C'_ 



to A) 



n-lj S n 

The kernel of the U -statistic M n can be expressed as 



(81) 



(82) 



8=1 
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where ipi{a\) = \/2cos2ai and ¥?2(«i) = \/2sin2ai. 



From Theorem 1 on page 79 in ll38l . we derive that nM n converges in distribution to Z\ + Z\ — 2 
where Z\ and Z2 are independent and standard normal, namely that nM n + 2 converges in distribution 
to a chi-square random variable of degree 2. Similarly to the treatments in Theorem |2j we can have 



implying that nR n converges to in probability. By letting V n = n(M n + R n ) + 2, the theorem is 
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